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(54) Image encoding using object siiape information 



(57) An image frame signal encoding method 
encodes a shape signal of an object in the digital image 
frame signal and decodes the encoded shape signal to 
provide a reconstructed shape signal and converts 
background pixel values In the digital image frame sig- 
nal to extension values to thereby provide an extension 
image frame signal. AtkJ an image frame signal encod- 



ing method detects object t>locks including one or more 
reconstructed object pixels therein and encodes a 
smallest process Ucxk having a dimension of L x L pix- 
els and including all the reconstructed object pixels 
within each obiject block. 
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Description 

Field of the Invention 

The present invention relates to a method and an 
apparatus for encoding an image signal at a low bit-rate: 
and. more particularly, to a method and an apparatus for 
encoding an image signal having an object by utilizing 
the shape of the object. 

Description of the Prior Art 

In various electronic applications such as high defi- 
nition TV and video telephone systems, a video signal 
may be transmitted in a digital form. When the video sig- 
nal comprising a sequence of video "frames" is 
expressed in a digital form, there occurs a substantial 
amount of digital data: lor each line of a video frame is 
defined by a sequence of digital data elemertts referred 
to as "pixels". Since, however, the available frequency 
bandwidth of a conventional transmission channel is 
limited, in order to transmit the substantial amount of 
digital data through the fixed channel, a video signal 
encoding method is normally used to compress the dig- 
ital data. 

One of such methods for encoding image signals 
for a low bit-rate encoding system employs the so-called 
object-oriented analysis-synthesis coding technique 
(see Michael Hotter, "Obiect-Oriented Analysis-Synthe- 
sis Coding Based On Moving Two-Dimensional 
Objects". Signal Processing: Image Communication. 2, 
409-428(1990)). 

According to the object-oriented analysis-synthesis 
coding technique, an input image signal, which has 
moving objects, is divided into the ol^ects; and three 
sets of parameters, i.e., ttiose for defining motion, con- 
t<Mjr and pixel data of each object, are processed 
through different encoding channels. 

In case of processing image data or pixels lying 
within an object, a transform coding technique for 
reducing spatial redundancies contained in the inrtage 
data may be employed in the object-oriented analysis- 
synthesis coding techrtique. One of the most frequently 
used transform coding techniques for image data conv 
presston is a DCT(discrete cosine transform) based 
block transformation coding technique, which converts 
a block of digital image data, for example, a block of 8 x 
8 pixels, into a set of transform coefficient data. This 
method is described in, e.g.. Chen and Pratt. "Scene 
Adaptive Coder. IEEE Transa ctions on Communica- 
tions. COM-32. No. 3, pp. 225-232 (March 1984). 

In the DCT t)ased block transformation coding tech- 
nique, a backgrouTKl or non-object region within a block 
is filled with, e.g., 0. an average pixel value or a mirror 
image of an object region in the tslock, and then transfor- 
mation is carried out Fig. 1A shows an object regk>n 
and a background region in the block. Referring to Frgs. 
IB and 1C, conventional methods for filling the t>ack- 
ground region are illustrated for a 1 -dimensional case. 



Specifically, in Rg. 1 B. the background region is filled 
with 0; arxJ in Rg. 1C. the background region is filled 
with an average pixel value in the object region. 

Even though these methods have the advantage of 

5 being able to utilize two-dimensional DCT blocks used 
in conventional methods (such as Joint Photographic 
Experts Group: JPEG. Moving Pictures Experts Group: 
MPEG, H.261 etc.). it also introduces unnecessary or 
undesirable data in the l>ackground region of the image. 

10 and is, therefore, inefficient from a data compression 
point of view. 

Summarv of the Invention 

IS It is, therefore, a primary object of the present 
invention to provide a metiiod and an apparatus for 
effectively encoding an invage frame signal having an 
object by utilizing the shape of the object, thereby 
improving the data compression efficiency thereof. 

20 In accordance with one aspect of the present inven- 
tion, there is provided a method for use in an encoder of 
a digital image frame signal having an object, wherein 
the digital innage frame signal is divided into a plurality 
of equal-sized blocks of N x N pixels and includes object 

2S pixels contained in tiie object and background pixels 
located outskJe thereof. N being a positive integer, the 
method conrprising the steps of: 

(a) encoding a shape signal of the object in the dig- 
so ital inr^ge frame signal, the shape signal including 

data for the size, position and contour of the object; 

(b) decoding the encoded shape signal to provkje a 
reconstructed shape signal; 

(c) converting the values of the background pixels 
35 in the digital image frame signal to extension values 

obtained by using the values of the object pixels, to 
thereby provkJe an extension image frame signal; 

(d) fomning a reconstructed object in tiie extension 
image frame signal based on the reconstructed 

40 shape signal and detecting object blocks including 
one or more reconstructed object pixels therein, 
each of the reconstructed object pixels represent- 
ing a pixel included in the reconstructed object; 

(e) forming a smallest process block having a 
45 dimension of L x L pixels and including all the 

reconstructed object pixels within each object 
block. L being a positive integer ; and 

(f) encoding the smallest process block. 

so Brief Description of the Drawings 

The above and other objects and features of the 
present invention will become apparent from the follow- 
ing description of prefeaed embodiments given in con- 
ss junction with the acconnpanying drawings, in which: 

Rgs. 1 A to 1 C represent different methods for filling 
a background region; 

Rg. 2 provides a block diagram illustrating a digital 
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image frame signal encoding apparatus in accord- 
ance with the present invention; and 
Figs. 3A to 3D show the conventional repetitive 
padding technique. 

Detailed Description of the Preferre d Embodiments 

A digital image frame signal has a plurality of equal- 
sized blocks, wherein a typical size of a block ranges 
between 8x8 arxi 32 x 32 pixels. The digital image 
frame signal having an object includes object pixels 
contained in the object and background pixels located 
outside the object The background pixeJs may be rep- 
resented by pixels whose values are much larger or 
smaller than those in the range of the ordinary pixel val- 
ues. 

Referring to Rg. 2. there is shown a block diagram 
of an apparatus 20 for encoding a digital Image frame 
signal In accordance with the present invention. The 
encoding apparatus 20 comprises a first and a second 
coding channels 200 and 500. a padding device 300, a 
process block generator 400 and a formatting circuit 
600. The first coding channel 200 includes a shape 
encoder 210 and a shape decoder 220 and the second 
coding channel 500 includes a transform coder 610, a 
quantizer 520 and an entropy coder 530. 

As shown in Rg. 2. a shape signal including data for 
the size, position, and contour characterizing the shape 
of an object in the digital image frame signal is provided 
from the shape detector (no! shown) to the first coding 
channel 200 to be coded therein. At the shape encoder 
210. the shape signal is first encoded by using, e.g.. a 
binary arithmetk: code of JPEG(Joint Photographic 
Experts Group) and then the encoded shape signal is 
suppfied to the shape decoder 220 and the formatting 
circuit 600. The shape decoder 220 converts the 
encoded shape signal back into a reconstructed shape 
signal. And the reconstructed shape signal is provided 
to the process Wock generator 400. 

In the meantime, the padding device 300 performs 
a padding process for the digital image frame signal 
irputted thereto by using a conventional repetitive pad- 
ding technique to convert it to an extension Image frame 
signal to thereby improve a data compression efficiency 
at the second coding channel 500. Referring to Rg. 3A. 
there is shewn a digital image frame, wherein the black 
region represents an object region. The values of the 
pixels on the contour of the object as shown in Rg. 3A 
are extended in the horizontal and the vertical direction 
to fai the tjackground region as shown in Rgs. 3B arxi 
3C, respectively, wherein the sequence performing the 
horizontal and the vertical extension may be decided 
according to image characteristics. The horizontal 
extension may be performed on a row-by-row basis, 
while the vertical one may be performed on a column - 
by-oolumn basis. The background regions left unfaied 
after the horizontal and vertical extensions as shown in 
Fig. 3D may be filled by using pixel values of the hori- 
zontal and vertical extended regions. The extension 



image frame signal obtained from the padding block 300 
is provided to the process lAock generator 400. 

The process block generator 400 forms, first, a 
reconstructed object in the extension image frame 

5 based on the reconstructed shape signal provided from 
the shape decoder 220 and detects object blocks 
including one or more reconstructed object pixels 
therein, wherein the reconstructed object pixel repre- 
sents a pixel included in the reconsti^ucted object. And 

10 the process block generator 400 forms a smallest proc- 
ess block including alt the reconstructed object pixels 
within each object block and having a dimension of L x 
L pixels, L being a positive integer. The smallest proc- 
ess block is provided to the transform coder 510 in the 

IS second coding channel 500. 

The transform coder 510 transforms the image sig- 
nal of the smallest process t^locks in the spatial domain 
from the process block generator 400 into a set of trans- 
form coeff icients in the frequency domain by employing. 

20 e.g. . a discrete cosine transform(DCT) and provides the 
set of transform coeffk;ients to the quantizer 520. At the 
quantizer 520. the set of transform coefficients is quan- 
tized by using a known quantization method; and then 
the set of quantized transform coefficients is fed to the 

25 entropy coder 530 for furtiier processing. 

The entropy coder 530 encodes the set of quan- 
tized transform coefficients from the quantizer 520 by ■ 
using, ag.. a combination of run-length and variable 
length coding techniques to generate an encoded 

50 image frame stgrial. The encoded image frame signal 
from the entropy coder 530 is tiien provided to the for- 
matting circuit 600. 

The formatting drcuit 600 formats the encoded 
shape signal from the shape encoder 210 in the first 

35 encoding channel 200 and the encoded image frame 
signal from the entropy coder 530 in the second encod- 
ing channel 500. to thereby provide a formatted digital 
image frame signal to a ti'ansmitter(not shown) for the 
transmission thereof. 

40 WhPe the present invention has been described 
vtnth respect to tiie particular embodiments, it will be 
apparent to those skilled in the art that various changes 
and modifications may be made without departing from 
the scope of the invention as defined in the following 

45 daims. 

Claiins 

1 . A method for use in an encoder of a digital image 
so frame signal having an object wherein the digrtal 
image frame signal is divided into a plurality of 
equal-sized blocks of N x N pixels and indudes 
object pixels contained in the object and t>ack- 
ground pixels located outside thereof, N being a 
55 positive integer, the method comprising the steps 
of: 

(a) erKoding a shape signal of the object in the 
digital image frame signal, the shape signal 
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including data for the size, position and contour 
of the object: 

(b) decoding the encoded shape signal to pro- 
vide a reconstructed shape signal; 

(c) converting the background pixel values in s 
the digital image frame signal to extension val- 
ues obtained by using the object pixel values, 

to thereby provide an extension image frame 
signal; 

(d) forming a reconstructed object in the exten- io 
sion image frame signal based on the recon- 
structed shiape signal and detecting object 
blocks including one or more reconstructed 
object pixels therein, each of the reconstructed 
object pixels representing a pixel included in is 
the reconstructed object; 

(e) forming a smallest process block having a 
dimension of L x L pixels and including all the 
reconstructed object pixels within each object 
block. L being a positive integer ; and ^ 

(f) encoding the smallest process block. 

2. An apparatus tor encoding a digital image frame 
signal having an object, wherein the digital image 
frame signal is divided into a plurafity of equal-sized 2s 
blocks of N x N pixels and includes o^ect pixels 
contained in the object and background pixels 
located outskle thereof. N being a positive integer, 
which comprises: 

30 

means for encoding a shape signal of the 
object in the digital image frame agnal, the 
shape signal including data tor the size, posi- 
tion and contour of the object; 
means for decoding the encoded shape signal 3S 
to provide a reconstructed shape signal; 
means for converting the background pixel val- 
ues in the digHal image frame signal to exten- 
sion values obtained by using the object pixel 
values, to thereby provide a extension image 40 
frame signal; 

means for forming a reconstructed object in the 
extension image frame signal based on the 
reconstructed shape signal and detecting 
object t>locks including one or wotg recon- 45 
structed object pixels therein, each of the 
reconstructed object pixels representing a pixel 
included in the reconstructed object; 
means tor fonning a smallest process block 
having adimension of LxU pixels and including so 
all the reconstructed object pixels within each 
object blocK L being a positive integer; and 
means for encoding the smallest process 
block. 

55 
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FIG, 1A 
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FIG. 3A 
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